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Abstract — A problem of index coding with side information was 
first considered by Y. Birk and T. Kol (IEEE INFOCOM, 1998). In 
the present work, a generalization of index coding scheme, where 
transmitted symbols are subject to errors, is studied. Error- 
correcting methods for such a scheme, and their parameters, are 
investigated. In particular, the following question is discussed: 
given the side information hypergraph of index coding scheme 
and the maximal number of erroneous symbols S, what is the 
shortest length of a linear index code, such that every receiver 
is able to recover the required information? This question turns 
out to be a generalization of the problem of finding a shortest- 
length error-correcting code with a prescribed error-correcting 
capability in the classical coding theory. 

The Singleton bound and two other bounds, referred to as 
the a-bound and the K-bound, for the optimal length of a 
linear error-correcting index code (ECIC) are established. For 
large alphabets, a construction based on concatenation of an 
optimal index code with an MDS classical code, is shown to 
attain the Singleton bound. For smaller alphabets, however, this 
construction may not be optimal. A random construction is also 
analyzed. It yields another inexplicit bound on the length of an 
optimal linear ECIC. 

Further, the problem of error-correcting decoding by a linear 
ECIC is studied. It is shown that in order to decode correctly 
the desired symbol, the decoder is required to find one of the 
vectors, belonging to an afflne space containing the actual error 
vector. The syndrome decoding is shown to produce the correct 
output if the weight of the error pattern is less or equal to the 
error-correcting capability of the corresponding ECIC. 

Finally, the notion of static ECIC, which is suitable for use with 
a family of instances of an index coding problem, is introduced. 
Several bounds on the length of static ECIC's are derived, and 
constructions for static ECIC's are discussed. Connections of 
these codes to weakly resilient Boolean functions are established. 

Index Terms — index coding, network coding, side information, 
error correction, minimum distance, broadcast. 
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THe problem of Index Coding with Side Information 
(ICSI) was introduced by Birk and Kol IH, S. During 
the transmission, each client might miss a certain part of the 
data, due to intermittent reception, limited storage capacity or 
any other reasons. Via a slow backward channel, the clients 
let the server know which messages they already have in their 
possession, and which messages they are interested to receive. 
The server has to find a way to deliver to each client all 
the messages he requested, yet spending a minimum number 
of transmissions. As it was shown in [l], the server can 
significantly reduce the number of transmissions by coding 
the messages. 

The toy example in Figure 1 presents a scenario with 
one broadcast transmitter and four receivers. Each receiver 
requires a different information packet (we sometimes simply 
call it message). The naive approach requires four separate 
transmissions, one transmission per an information packet. 
However, by exploiting the knowledge on the subsets of 
messages that clients already have, and by using coding of 
the transmitted data, the server can just broadcast one coded 
packet. 



has X2,X3, X4 

requests xi 



has xi, X2,X3 
requests X4 




has xi,X3, Xi 
requests X2 

Fig. 1: An example of the ICSI problem 



has xi,X2, Xi 
requests 



Possible applications of index coding include communica- 
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tions scenarios, in which a satellite or a server broadcasts a set 
of messages to a set clients, such as daily newspaper delivery 
or video-on-demand. Index coding with side information can 
also be used in opportunistic wireless networks. These are the 
networks in which a wireless node can opportunistically listen 
to the wireless channel. The client may obtain packets that are 
not designated to it (see ||3|-||5l). As a result, a node obtains 
some side information about the transmitted data. Exploiting 
this additional knowledge may help to increase the throughput 
of the system. 

The ICS! problem has been a subject of several recent 
studies 13], ll6ll- lfT3]| . This problem can be viewed as a special 
case of the Network Coding (NC) problem (HD, |[T3]. In 
particular, as it was shown in (3], (IT], every instance of 
the NC problem can be reduced to an instance of the ICSI 
problem. 

B. Our contribution 

The preceding works on the ICSI problem consider scenario 
where the transmissions are error-free. In practice, of course, 
this might not be the case. In this work, we assume that the 
transmitted symbols are subject to errors. We extend some 
known results on index coding to a case where any receiver 
can correct up to a certain number of errors. It turns out that 
the problem of designing such error-correcting index codes 
(ECIC's) naturally generalizes the problem of constructing 
classical error-correcting codes. 

More specifically, assume that the number of messages that 
the server possesses is n, and that the designed maximal num- 
ber of errors is 6. We show that the problem of constructing 
ECIC of minimal possible length is equivalent to the problem 
of constructing a matrix L which has n rows and the minimal 
possible number of columns, such that 

wt (zL) > 2(5 + 1 for all z el, 

where T is a certain subset of F^\{0}. Here wt(a;) denotes the 
Hamming weight of the vector x, ¥q stands for a finite field 
with q elements, and is the all-zeros vector If I = FJ^'\{0}, 
this problem becomes equivalent to the problem of designing 
a shortest-length linear code of given dimension and minimum 
distance. 

In this work, we establish an upper bound (the K-bound) and 
a lower bound (the a-bound) on the shortest length of a linear 
ECIC, which is able to correct any error pattern of size up to 
S. More specifically, let H be the side information hypergraph 
that describes the instance of the ICSI problem. Let J\fq[H, 6] 
denote the length of a shortest-length linear ECIC over F^, 
such that every Ri can recover the desired message, if the 
number of errors is at most S. We use notation Ng[k, d] for the 
length of an optimal linear error-correcting code of dimension 
k and minimum distance d over F^. We obtain 

Ng[a{n),25+1]<M,['H,S] < Ng[K,{n),25 + 1], (1) 

where a{H) is the generalized independence number and 
Kq{H) is the min-rank (over Fg) of H. 

For linear index codes, we also derive an analog of the 
Singleton bound. This result implies that (over sufficiently 



large alphabet) the concatenation of a standard MDS error- 
correcting code with an optimal linear index code yields an 
optimal linear error-correcting index code. Finally, we consider 
random ECIC's. By analyzing its parameters, we obtain an 
upper bound on its length. 

When the side information hypergraph is a pentagon, and 
(5 = 2, the inequalities in ([T]l are shown to be strict. This 
implies that a concatenated scheme based on a classical error- 
correcting code and on a linear non-error-correcting index code 
does not necessarily yield an optimal linear error-correcting 
index code. Since ICSI problem can also be viewed as a source 
coding problem 16], [13|, this example demonstrates that 
sometimes designing a single code for both source and channel 
coding can result in a smaller number of transmissions. 

The decoding of a linear ECIC is somewhat different from 
that of a classical error-correcting code. There is no longer a 
need for a complete recovery of the whole information vector. 
We analyze the decoding criteria for the ECIC's and show 
that the syndrome decoding, which might be different for each 
receiver, results in a correct result, provided that the number 
of errors does not exceed the error-correcting capability of the 
code. 

An ECIC is called static under a family of instances of 
the ICSI problem if it works for all of these instances. Such 
an ECIC is interesting since it remains useful as long as the 
parameters of the problem vary within a particular range. 
Bounds and constructions for static ECIC's are studied in 
Section [Villi Connections between static ECIC's and weakly 
resilient vectorial Boolean functions are also discussed. 

The problem of error correction for NC was studied in 
several previous works. However, these results are not directly 
applicable to the ICSI problem. First, there is only a very 
limited variety of results for non-multicast networks in the ex- 
isting literature. The ICSI problem, however, is a special case 
of the non-multicast NC problem. Second, the ICSI problem 
can be modeled by the NC scenario [3 |, yet, this requires that 
there are directed edges from particular sources to each sink, 
which provide the side information. The symbols transmitted 
on these special edges are not allowed to be corrupted. By 
contrast, for error-correcting NC, symbols transmitted on all 
edges can be corrupted. 

The paper is organized as follows. Basic notations and 
definitions, used throughout the paper, are provided in Sec- 
tion nil The problem of index coding with and without error- 
correction is introduced in Section |III] Some basic results 
are presented in that section. The Q;-bound and the K-bound 
are derived in Section |IV] The Singleton bound is presented 
in Section |V] Random codes are discussed in Section |VI] 
Syndrome decoding is studied in Section IVIII A notion of 
static error-correcting index codes is presented in Section lVIIII 
Several bounds on the length of such codes are derived, and 
connections to resilient function are shown in that section. 
Finally, the results are summarized in Section |IX] and some 
open questions are proposed therein. 

II. Preliminaries 

In this section we introduce some useful notation. Here Fg 
is the finite field of q elements, where q is a power of prime. 
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and F* is the set of all nonzero elements of Fg . 

Let [n] = {1,2, . . . ,n}. For the vectors u = 
(mi,M2, ■••,"«) e F^ and v = {vi,V2, . ■ ■ ,Vn) e F^', the 
(Hamming) distance between u and v is defined to be the 
number of coordinates where u and v differ, namely, 

d(u,'i;) = \{i e [n] : u, ^ Vi}\ . 

If u G F^' and M C F^ is a set of vectors (or a vector 
subspace), then the last definition can be extended to 

A(u. M) ~ min d(it, v) . 

veM 

The support of a vector u G F^' is defined to be the set 
supp(m) — {i G [n] : Ui ^ 0}. The (Hamming) weight of 
a vector it, denoted wt(tt), is defined to be |supp(u.)|, the 
number of nonzero coordinates of u. Suppose EC [n]. We 
write u <] E whenever supp(M) C E. 

A /c-dimensional subspace of F^ is called a linear 
[n, k, d]q code over F^ if the minimum distance of 

d('rf) = min d(u,v) , 

uG'g, ve'^, u^v 

is equal to d. Sometimes we may use the notation [n, k]q for 
the sake of simplicity. The vectors in 'rf are called codewords. 
It is easy to see that the minimum weight of a nonzero 
codeword in a linear code is equal to its minimum distance 
d('i'). A generator matrix G of an [n, k]q code "if is a fc x 7i 
matrix whose rows are linearly independent codewords of 
Then — {yG : y G F^}. The parity-check matrix of '^i' is an 
{n — k)xn matrix H over F^ such that c G Hc^ = 0"^. 
Given q, k, and d, let Nq[k, d] denote the length of the shortest 
linear code over Fg which has dimension k and minimum 
distance d. 

We use e, = (0, . . . , 0, 1, 0, . . . , 0) G F'^' to denote the 

unit vector, which has a one at the ith position, and zeros 
elsewhere. For a vector y — {yi,y2, ■ ■ ■ ,yn) and a subset 
B = {ii,i2, ■ . ■ , ib} of [n], where ii < 12 <■■■< ib, let y^ 
denote the vector {yi^ ,yi^, . . . ,y,^). 

For an n X matrix L, let Li denote its ith row. For a 
set -EC [n], let Le denote the \E\x N matrix obtained from 
L by deleting all the rows of L which are not indexed by 
the elements of E. For a set of vectors Af , we use notation 
span(Af) to denote the linear space spanned by the vectors 
in M. We also use notation colspan(X) for the linear space 
spanned by the columns of the matrix L. 

Let = (V, 5) be a graph with a vertex set V and an edge 
set £. The graph is called undirected if every edge e G f , 
e = {u, v}, and u, u G V. A graph Q is directed if every edge 
e G £ is an ordered pair e = {u,v), u,v € V . A directed 
graph Q is called symmetric if 

(m, w) G f ^ (w, u) G £ . 

There is a natural correspondence between undirected graph 
Q = (V, £) and directed symmetric graph Q' = (V, £') defined 
as 

£^{{u,v} : {u,v)e£'} . (2) 

Let Q be an undirected graph. A subset of vertices 5 C V is 
called an independent set if Vu, v £ S, {u, v} ^ £. The size 



of the largest independent set in is called the independence 
number of Q, and is denoted by a{Q). The graph Q = (V, £) 
is called the complement of Q — (V, £) if 

£ = {{u, v} : u G V, w G V, {u, v} ^ £} . 

A coloring of Q using x colors is a function t/j : V — [x], 
such that 

Ve = {u,v} G £ : ip{u) ^ . 

The chromatic number of Q is the smallest number x such that 
there exists a coloring of Q using x colors, and it is denoted 
by x{G)- By using the correspondence dU, the definitions 
of independence number, graph complement and chromatic 
number are trivially extended to directed symmetric graphs. 

III. Index Coding and Error Correction 

A. Index Coding with Side Information 

Index Coding with Side Information problem considers 
the following communications scenario. There is a unique 
sender (or source) 5, who has a vector of messages x = 
{xi,X2, ■ ■ ■ , Xn) in his possession. There are also m receivers 
Ri, R2, ■ ■ ■ , Rm, receiving information from S via a broadcast 
channel. For each i G [m], Ri has side information, i.e. Ri 
owns a subset of messages {xj}j^Xi, where Xi C [n]. Each 
Ri, i G [m], is interested in receiving the message x^j^) (we 
say that Ri requires Xf^i)), where the mapping / : [m] — > [n] 
satisfies /(i) ^ Xi for all i G [m]. Hereafter, we use the 
notation X = {Xi, X2, . . . , X„i). An instance of the ICSI 
problem is given by a quadruple (m, n, X, /). It can also be 
conveniently described by a directed hypergraph llTjI . 

Definition 3.1: Let (m, n, X, f) be an instance of the ICSI 
problem. The corresponding side information (directed) hyper- 
graph H — %{m,n, X , f) is defined by the vertex set V = [n] 
and the edge set £-u, where 

£n = {(/(«), ^0 : *e [n]} . 

We often refer to (m, n, X, f) as an instance of the ICSI 
problem described by the hypergraph Ti. 

Each side information hypergraph H — {V,£-h) can be 
associated with the directed graph Q-u = (V, £) in the 
following way. For each directed edge {f{i),Xi) G £-h there 
will be \Xi\ directed edges {f{i),v) G £, for v G Xi. When 
m — n and f{i) = i for all i G [m], the graph Q-u is, in fact, 
the side information graph, defined in |6|. 

The goal of the ICSI problem is to design a coding scheme 
that allows S to satisfy the requests of all receivers Ri in the 
least number of transmissions. More formally, we have the 
following definition. 

Definition 3.2: An index code over Fg for an instance of 
the ICSI problem described hy H = H{m,n, X , f) (or just 
an H-IC over Fg), is an encoding function 

£ : F';^F^, 
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such that for each receiver Ri, i ^ [m], there exists a decoding 
function 

satisfying 

Va^GF^ : T),{e{x),xx,) = Xf^i-, . 

Sometimes we refer to such £ as a non-error-correcting index 
code. The parameter N is called the length of the index code. 
In the scheme corresponding to this code, S broadcasts a 
vector (£(a;) of length N over F^. 

Definition 3.3: A linear index code is an index code, for 
which the encoding function (£ is a linear transformation over 
¥q. Such a code can be described as 

Va; e F^ : £(a;) = xL , 

where L is an n x matrix over ¥q. The matrix L is called 
the matrix corresponding to the index code (£. The code £ is 
also referred to as the linear index code based on L. 

Hereafter, we assume that X = (Ai)jg[„j] is known to S. 
Moreover, we also assume that the code £ is known to each 
receiver Ri, i E [m]. In practice this can be achieved by a 
preliminary communication session, when the knowledge of 
the sets for i £ [ni] and of the code 2; are disseminated 
between the participants of the scheme. 

Definition 3.4: Suppose H — H{m,n,X,f) corresponds 
to an instance of the ICSI problem. Then the min-rank of H 
over ¥q is defined as 

Kq{n) = min{rankF^({i;, + ej:(,)}ig[„]) : 

i;, € F^ , -y, <] X,} . 

Observe that Kq{'H) generalizes the min-rank over F^ of the 
side information graph, which was defined in [6]. More specif- 
ically, when m = n and f{i) = i for all i G [to], Gn becomes 
the side information graph, and Hq{Ti.) = mm-m'nkq{Qu) ■ 
The min-rank of an undirected graph was first introduced by 
Haemers lfT6l to bound the Shannon capacity of a graph, and 
was later proved in [6|, [7| to be the smallest number of 
transmissions in a linear index code. 

The following lemma was implicitly formulated in [6] for 
the case where m = n, q = 2, f(i) = i for all i G [n], and 
generalized to its current form in IITtI . 

Lemma 3.5: Consider an instance of the ICSI problem 
described by 7^ = 7^ (to, n, X, /) . 

1) The matrix L corresponds to a linear H-IC over ¥q if 
and only if for each i G [to] there exists Vi G ¥q such 
that 

• Vt<lXt ; 

» Vi + ef[i) G colspan(i) . 

2) The smallest possible length of a linear "H-IC over ¥q is 



B. Error-Correcting Index Code with Side Information 

Due to noise, the symbols received by Ri, i G [to], may be 
subject to errors. Consider an ICSI instance (to, n, X, /), and 
assume that S broadcasts a vector (B{x) G F^. Let G F^ 
be the error affecting the information received by Ri, i <E [m]. 
Then Ri actually receives the vector 

= €ix) + e,; G Ff , 

instead of €{x). The following definition is a generalization 
of Definition 13.21 

Definition 3.6: Consider an instance of the ICSI problem 
described by 7^ = T-L{'m,n, X, f). A S-error-correcting index 
code {{S,TL)-ECIC) over ¥q for this instance is an encoding 
function 

such that for each receiver Ri, i £ [to], there exists a decoding 
function 

: F^ xFI-^'I -^¥q , 

satisfying 

Va;,e, G F;^\ wt(e,) < (5 : T)i{€{x) + e^,xx,) ^ xjf^i) . 

The definitions of the length, of a linear index code, and of the 
matrix corresponding to an index code are naturally extended 
to an error-correcting index code. Note that if £ is an TL-IC, 
then it is a (0,H)-ECIC, and vice versa. 

Definition 3.7: An optimal linear ((5, 'H)-ECIC over F^ is 
a linear {S,'H)-EC1C over F^ of the smallest possible length 

Consider an instance of the ICSI problem described hy H = 
H{m, n, X, /). We define the set of vectors 

{z e¥q : 3i e [to] such that = and ^ O} . 
For all i G [to], we also define 

y.^[n]\[{fiz)}UX?). 

Then the collection of supports of all vectors in 2{q, H) is 
given by 

J{n)= U {{f{i)}UY, : Y,cy,y (3) 

iG[rn] 

The necessary and sufficient condition for a matrix L to be 
the matrix corresponding to some {6, 'H)-ECIC is given in the 
following lemma. 

Lemma 3.8: The matrix L corresponds to a {S,'H)-ECIC 
over ¥q if and only if 

wt (zL) > 2(5 + 1 for all z G V.) . (4) 

Equivalently, L corresponds to a {S,'H)-ECIC over ¥q if and 
only if 
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for all K e J{H) and for all choices of <E¥*,ie K. 
Proof: For each a; G F^, we define 

B{x, 6) = {ye¥^ : y = xL + e,ee¥^, wt(e) < 6} , 

the set of all vectors resulting from at most 5 errors in the 
transmitted vector associated with the information vector x. 
Then the receiver Ri can recover a;/(i) correctly if and only 
if 

B{x,d)nB{x',5) = 0, 
for every pair x,x' e F^* satisfying: 

XX, = x'x^ and a;/(i) ^ x'^^f^ . 

(Observe that Ri is interested only in the bit a;/(i), not in the 
whole vector x.) 

Therefore, L corresponds to a {5, 'H)-ECIC if and only if 
the following condition is satisfied: for all i G [m] and for all 
x,x' e F^' such that xxi ~ x'^, and a;j(j') ^ ^/(i)' holds 

Ve,e' G Ff , wt(e) s: S, wt(e') ^ S : 

xL + ey^ x'L + e' . (5) 

Denote z — x' ~ x. Then, the condition in Q can be 
reformulated as follows: for all i G [n] and for all 2; G F^ 
such that = and zj(i) 7^ 0, it holds 

Ve, e' G F^, wt(e) S, wt(e') S : zL ^ e - e' . (6) 
The equivalent condition is that for all z G I{q, Ti), 

wt{zL) ^ 2(5 + 1 . 
Since for z G X{q, TL) we have 

zL= ^ Ziii, 

iG5upp(z) 

the condition Q can be restated as 

wt ^ ZiLi j > 2(5 + 1, 

for all K G J{T-L) and for all choices of nonzero Zi G Fg, 
i G isT. ■ 



Corollary 3.10: The matrix L corresponds to an H-IC over 
■^q if and only if 



The next corollary follows from Lemma 13.81 in a straight- 
forward manner It is not hard to see that the conditions 
stated in Lemma 13.81 and in the corollary below are, in fact, 
equivalent. 

Corollary 3.9: For all i G [m], let 

M, =span({i, : j £ y^}) . 

Then, the matrix L corresponds to a {5, 7^)-ECIC over F, if 
and only if 



Vi G [m] : d(i/(,),M,) ^26 + 1. 



(7) 



The next corollary also follows directly from Lemma l3?8] bv 
considering an error-free setup, i.e. (5 = 0. It is easy to verify 
that the conditions stated in this corollary and in Lemma 
are equivalent, as expected. 



wt 



\ieK ) 



for all K G J^U) and for all choices of Zi G F*, i G K, or, 
equivalently, 

Vi G [m] : ^ span({ij}jgj;J . 

Example 3.11: Let (7 = 2, to = n = 3, and /(i) = i for 
i G [3]. Suppose Xx = {2,3}, X-2, = {1,3}, and X-i = {1,2}. 
Let 

/I 1 1 o^ 
i = 1 1 1 
Vi 1 1, 

Note that L generates a [4,3, 1]2 code, which has minimum 
distance one. However, the index code based on L can still 
correct one error Indeed, let H. = H{3, 3, X, /), we have 

1(2, -H) = {100,010,001}. 

Since each row of L has weight at least three, it follows 
that wt{zL) > 3 for all z G I{2,n). By Lemma [l8l L 
corresponds to a (1,H)-ECIC over F2. 

In fact, for this instance, even a simpler index code of length 
three, based on 

/I 1 1^ 
L' = 1 1 1 

yi 1 i> 

is a (1,7^)-ECIC over F2. 

Example 3.12: Assume that m = n and f{i) = i for all 
i G [m]. Furthermore, suppose that Xi = for all i G [to] 
(i.e. there is no side information available to the receivers). 
Let n ^ ■H{m,n,XJ). Then, I{q,n) = F^\{0}. Hence, 
by Lemma 13.81 the n x N matrix L corresponding to a 
{6, 'H)-ECIC over F, (for some integer (5 ^ 0) is a generating 
matrix of an [N, n, ^ 2S + l]q linear code. Thus, under these 
settings, the problem of designing an optimal ECIC is reduced 
to the problem of constructing an optimal classical linear error- 
correcting code. 

Observe however, that for general X, changing the order of 
rows in L can lead to ECIC's with different error-correcting 
capabilities. Therefore, the problem of designing an optimal 
linear ECIC is essentially the problem of finding the matrix L 
corresponding to that code. However, the minimum distance 
of the code generated by the rows of L is not necessary 
a valid indicator for goodness of an ECIC. Sometimes, as 
Example 13.1 II shows, matrix L with redundant rows yields 
a good ECIC. 

IV. The a-BouND and the k-Bound 

Let (m, n, X, f) be an instance of the ICSI problem, and 
let y, be the corresponding side information hypergraph. Next, 
we introduce the following definitions for the hypergraph TL. 
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Definition 4.1: A subset H of [n] is called a generalized 
independent set in % if every nonempty subset K of H 
belongs to J(7^). 

Definition 4.2: A generalized independent set of the largest 
size in "H is called a maximum generalized independent set. 
The size of a maximum generalized independent set in Ti. is 
called the generalized independence number, and denoted by 
a(H). 

When in = n and /(i) = i for all i e [n], the generalized 
independence number of H is equal to the maximum size of 
an acyclic induced subgraph of Qfi, which was introduced 
in f6l. In particular, when Q-u is symmetric, a('H) is the 
independence number of Q-^. We prove the latter statement 
in the Appendix. 

Next, we present a lower bound on the length of a {d,TL)- 
ECIC. We call this bound a-bound. 

Theorem 4.3 (a-bound): The length of an optimal linear 
{5, n)-EClC over satisfies 

Moreover, the equality is attained if there exists an n x a{'H) 
matrix B — (bij) over Fg satisfying the following condition: 
for a\\ K e JiH) and for all choices of G F*, i e K, there 
always exists some j such that 



For every K e JiH) and for all choices of Zi e ¥*, i e K, 
we have 



^0. 



Proof: Consider an n x iV matrix L, which corresponds 
to a {5,'H)-ECIC. Let H ~ {ii,i2, ■ ■ ■ ,*a(-H)} be a maximum 
generalized independent set in "H. Then, every subset K C H 
satisfies K G J{T-L). Therefore, 



wt 



z^U >25+l 



for ?i\\K C1H,K ^ 0, and for all choices of G F*, i £ ivT. 
Hence, the a{T-L) rows of L, namely Li-^, Li^, . . . , Li^^^^y 
form a generator matrix of an [N,a{'H),2d + l]q code. 
Therefore, 

N >Ng[a{H),25 + l] . 



Next, we assume the existence of a matrix B satisfying 
the properties stated in the theorem. Let L' be a generator 
matrix of some [N' , a{'H),26 + l]q code, where N' = 
Nq[a{'H),26 + I]. We construct the n x N' mati'ix L as 
follows. For i G [n], let 

"(«) 



wt 



\ieK / \ieK j=i J 

■j=i \ieK I ^ 
> 2(5 + 1 , 

where the last transition is due to the existence of j G [q:(?^)] 
such that 

Zi bij ^ , 

and the fact that i^ 's are linearly independent nonzero code- 
words of a code of minimum distance 26 + 1. 

We conclude that the index code based on L is capable of 
correcting 6 errors. Therefore, A/'g[7{,(5] = Ng[a{'H),2S + 1]. 

■ 

Example 4.4: Let q = 2, m = n = 5, f{i) = i for all 
i G [m], and S — 2. Assume 

^-1== {2,3,4}, ^-2 = {3,4,5}, ;i:'3 = {4, 5, 1} , 
^"4 = {5,1,2}, ^-5 = {1,2,3}. 

Let n = 7^(5,5, A",/). Then 

Jm ={{1},{1,5},{2},{2,1},{3}, 

{3,2},{4},{4,3},{5},{5,4}}. 

It is easy to check that a{'H) = 2. Therefore, Theorem 
implies that 

^2[H,2] > N2[2,5]^8. 

The last equality can be verified by ifTsl . 
On the other hand, take the matrix 



B 



The matrix B satisfies the property that for all K G J{T-L), 
K ^ 0, there exists j G [2] such that 

From Theorem US we have M2[H., 2] ^N2[2,^]^ 8. 

Remark 4.5: In |6|, when m — n and f{i) = i for all 
i G [n], a{T-L) was shown to be a lower bound on the length 
of a (non-error-correcting) linear index code. However, the a- 
bound in Theorem 14. 31 does not follow from the results in [6 |. 
The reason is that a concatenation of an optimal linear error- 
correcting code with an optimal non-error-correcting index 
code might fail to produce an optimal linear ECIC. This is 
illustrated later in Example 14.81 
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The following proposition is based on the fact that concate- 
nation of a (5-error-correcting code with an optimal (non-error- 
correcting) n-lC yields a (^,H)-ECIC. 

Proposition 4.6 (n-bound): The length of an optimal 
{5, H)-ECIC over satisfies 

J\fg[n,S]<Ng[Kg{n),2S + l] . 

Proof: Let G, which is an n x Kq{H) matrix, correspond 
to an optimal H-IC over ¥q. Denote 

y = xGe F^'(^^ 

Let M be a generator matrix of an optimal [N, Kq{H), 25+l]q 
code where 

N^Nq[Kq{n),26+l]. 

Consider a scheme where S broadcasts the vector yM G F^. 
If less than 6 errors occur, then each receiver Ri is able to 
recover y by using Hence each Ri is able to recover Xfi^^y 
Therefore, for the index code based on L, 

L = CM , 

each receiver Ri is capable to recover if the number of 
errors is less or equal to 5. The length of the corresponding 
ECIC is iV = Nq[Kq(n), 26+1]. Therefore, 

f^q[n,S]<Nq[Kqin),2S+l] . 



By combining the results in Theorem 14.31 and in Proposi- 
tion 14.61 we obtain the following corollary. 

Corollary 4.7: The length of an optimal linear {S, H)-ECIC 
over ¥q satisfies 

Nq[a{H),26 + l]<Uq[n,S] < Nq[Kq{n),26 + 1] . 

It is shown in the example below that the inequalities in 
Corollary 14.71 can be strict. In particular, it follows that mere 
application of an error-correcting code on top of an index code 
may fail to provide us with an optimal linear ECIC. This fact 
motivates the study of ECIC's in Sections UlIUVIII 

Example 4.8: Let q — 2, m = n = 5, 6 = 2, and f{i) = i 
for all i e [m]. Assume 

A'i = {2,5}, ^2 = {1,3}, A'3 = {2,4}, 
-¥4 = {3,5}, A'5 = {1,4}. 

Let n = 7^(5, 5, X, /). Then we have 

^('«) = {{1},{1,3},{1,4},{1,3,4}, 

{2},{2,4},{2,5},{2,4,5}, 
{3},{1,3},{3,5},{1,3,5}, 
{4},{1,4},{2,4},{1,2,4}, 

{5}, {2, 5}, {3, 5}, {2, 3, 5}}. 

The side information graph Q-^ of this instance is a pentagon. 
It is easy to verify that a{T-L) = a{Q-u) = 2. It follows from 



Theorem 9 in [T| that K2{'H) = min-rank2(^-H) = 3. Thus, 
from 11181 we have 

iV2[2,5]==8 and iV2[3,5] = 10. 

Due to Corollary 14.71 we have 

8 < A/'2[H,2] < 10. 

Using a computer search, we obtain that A/2['H,2] = 9, and 
the corresponding optimal scheme is based on 
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It is technical to verify that for all K G J{T-L), 

wt ( ^ j > 5 . 

Therefore by Lemma ISTSl for the index code based on L, each 
receiver Ri is able to recover Xi, if the number of errors is 
less than or equal to 2. Observe that the length of the ECIC 
corresponding to L lies strictly between the a-bound and the 
K-bound. 

When the graph Q is undirected (or symmetric), the follow- 
ing theorem holds (see, for instance, |16|). 

Theorem 4.9: Let x{Q) denote the chromatic number of the 
complement of the graph Q. Then, 

oi{Q) < min-rank,(f/) < x{G) ■ 

When m = n and f{i) = i for all i G [m], we have that 
a{'H) — a{Q-u) and Kq{TL) — min-rankg(C/^). Moreover, if 
the graph Q-^ is symmetric and satisfies a{Q-H) — xiOn), then 
from Corollary 14.71 we have 

Mq[n,6] = Nq[a{H),25 + l] = Nq[Kq{H),26 + 1] , 

for all q, and the corresponding bounds in Corollary 14.71 are 
tight. 

Definition 4.10: An undirected (or symmetric) graph Q is 
called perfect if for every induced subgraph Q' of Q, a{Q') — 

x{Q')- 

Perfect graphs include families of graphs such as trees, 
bipartite graphs, interval graphs, and chordal graphs. If m — n, 
f{i) = i for all i G [to], and Gh is perfect, then the bounds in 
Corollary 14.71 are tight. For the full characterization of perfect 
graphs, the reader can refer to [lT9l . 

V. The Singleton Bound 

The following bound is analogous to Singleton bound for 
classical linear error-correcting codes. 

Theorem 5.1 (Singleton bound): The length of an optimal 
linear ((5, 7^)-ECIC over Fg satisfies 

Mq[n,5] > Kq{n) + 2S . 
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Proof: Let L be the n x J\fq\H^5\ matrix corresponding 
to some optimal (5, H)-ECIC. Let L' be the matrix obtained 
by deleting any 25 columns from L. 
By Lemma 13.81 L satisfies 



wt 2^ ZiLi > 25 + 1 , 
Vie A' / 

for all K e Jin) and all choices of z, e ¥*, i e K. We 
deduce that the rows of L' also satisfy 



> 1 . 




By Corollary 13. 101 L' corresponds to a linear T-L-IC Therefore, 
by Lemma [331 part 2, L' has at least Kq{'H) columns. We 
deduce that 

which concludes the proof. ■ 

The following corollary from Proposition 14.61 and Theo- 
rem 15.11 demonstrates that, for sufficiently large alphabets, a 
concatenation of a classical MDS error-correcting code with 
an optimal (non-error-correcting) index code yields an optimal 
ECIC. However, as it was illustrated in Example 14.81 this does 
not hold for the index coding schemes over small alphabets. 

Corollary 5.2 (MDS error-correcting index code): For q > 

jVg[H.,d]= Kg{n)+26 . (8) 

Proof: From Theorem 15.11 we have 

J^q[n,d] > Kg{n) + 2S . 

On the other hand, from Proposition 14.61 

AA,[H,(5] <Ng[Kg{H),2S + l] = Kg{H) + 26 , 

for q > Kq{'H) + 2(5 — 1 (by taking doubly-extended Reed- 
Solomon (RS) codes). Therefore, for these q, dHJ holds. ■ 

Remark 5.3: Let q = 2, m = n = 2i + 1 (i > 2), and 
f{i) = i for all i € [m]. Let Xi = {2,n} and X„ = {1, n-l}. 
For2 < z < n, let X, = + Let H = T-L{n,n, X , f). 

Then Q-^ is the (symmetric) odd cycle of length n. Therefore, 
a{%) = a(Q-H) = £■ From Q, K2(H) = min-rank2(tJ-H) = 
£ + 1. From a-bound, 

^f2[H,5] >N2[l,2S+l] . 
By contrast, from Theorem 15.11 

^2[n,6] >{£+l) + 2S . 
As there are no nontrivial binary MDS codes, we have 

N2[i,2S+l] >£ + 25+l, 

for all choices of 5 > 0. Therefore, for these choices, the 
a-bound is at least as good as the Singleton bound. 



VI. Random codes 

In this section we prove an inexplicit upper bound on 
the optimal length of the ECIC's. The proof is based on 
constructing a random ECIC and analyzing its parameters. 

Theorem 6.1: Let Ti, = H{m, n, X, /) describe an instance 
of the ICSI problem. Then there exists a {S, H)-ECIC over 
of length N if 



where 



25 



e=Q 



Vq{N,2S) 



K,(7V,2<5)=^(^)(g-l) 



(9) 



is the volume of the g-ary sphere in F^. 

Proof We construct a random n x N matrix L over F^, 
row by row. Each row is selected independently of other rows, 
uniformly over F^. Define vector spaces 

M, = span({i, : j e y,}) 
for all i G [m]. We also define the following events: 
VieH : Event Si = {d(i/(i), M^) < 2(5 + l} , 
and 

Event Epaii = 

{L does not correspond to a {S,H)-ECIC} . 

The event Ei represents the situation when the receiver Ri 
cannot recover Xf(^iy Then, by Corollarv 13.91 the event Epaii 
is equivalent to Uie[m] Therefore, 

\ie[m] / ie[m] 



For a particular event Ei, i ^ [m], 

q\y^\Vq{N, 26) 



(11) 



There exists a matrix L that corresponds to a ((5, H)-ECIC 
if Pr {Epaii) < 1. It is enough to require that the right-hand 
side of ( fTOb is smaller than 1. By plugging in the expression 
in (fTTT i. we obtain a sufficient condition on the existence of a 
((5, H)-ECIC over F,: 

Va{N.28) ^ i^^i 



■iG[m] 



< 1 . 



Remark 6.2: The bound in Theorem 16.11 does not take 
into account the structure of the sets Xi's, other than their 
cardinalities. Therefore, this bound generally is weaker than 
the K-bound. On the other hand, for a particular instance of 
the ICSI problem, it is easier to compute this bound, while 
calculating the K-bound in general is an NP-hard problem. 
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Remark 6.3: The bound in Theorem 16.11 implies a bound 
on Kq{'H), which is tight for some "H. Indeed, fix S — 0. The 
bound implies that there exists a linear index code of length 
N whenever 

^ < . (12) 

Let TO = n = 2£ + 1 (£ > 2), and /(i) = i for all i e [n\. Let 
A"! = [n]\{l,2,n} and = [n\\{l,n- l,n}. For 2 < i < 
n-1, let X, = [n]\{i + 1}. Let n = n{n, n, X, f) 

be the corresponding side information hypergraph. Then Qu 
is the complement of the (symmetric directed) odd cycle of 
length n. We have lA-^l = 2£ - 2 for all i e [n]. Then ^ 
becomes 

N > 2 + log^(2£+ 1) . 

If (7 > 2£+ 1 then we obtain N ^ ?>. Observe that in this case 
Kq{T-L) ~ mm-Tankq{Q^) = 3 (see fT3^, Claim A.l]), and thus 
the bound is tight. 

VII. Syndrome decoding 

Consider the {6, H)-ECIC based on a matrix L. Suppose 
that the receiver Ri, i ^ [m], receives the vector 

yi = xL + e, , (13) 

where xL is the codeword transmitted by S, and is the 
error pattern affecting this codeword. 

In the classical coding theory, the transmitted vector c, the 
received vector y, and the error pattern e are related by y = 
c + e. Therefore, if y is known to the receiver, then there is 
a one-to-one correspondence between the values of unknown 
vectors c and e. For index coding, however, this is no longer 
the case. The following theorem shows that, in order to recover 
the message from using (T3[ . it is sufficient to find 
just one vector from a set of possible error patterns. This set 
is defined as follows: 

A(ei) = {e; + z : z & span({Xj}jej;J} . 

We henceforth refer to the set Ci{ei) as the set of relevant 
error patterns. 

Lemma 7.1: Assume that the receiver Ri receives y^. 

1) If Ri knows the message Xf(i-^ then it is able to determine 
the set Liiei). 

2) If Ri knows some vector e e Ci(e..i) then it is able to 
determine Xf(i^. 

Proof: 

1) From ( fT3] l, we have 

Vi = ^f{i)Lf(2) + Xx.Lxi + xy^Ly. + . (14) 

If Ri knows Xf{i), then it is also able to determine 

e,: + xy^Ly^ ^ Ui" ^f(i)Lf{i) - xx,Lxi e A(ei) . 

Since Ri has a knowledge of L, it is also able to 
determine the whole Ci{ei). 

2) Suppose that Ri knows a vector 

e = + ^ ZjLj e Ci{ei) , 



for some z — {z-j)y. G '. We show that Ri is able 
then to determine xj^iy Indeed, we re-write (fl4] i as 

Vi = Xf('i)Lf{^) + xx.Lx, + {xy, - z)Ly^ + e . (15) 
The receiver Ri can find some solution of the equation 

Vi = ^f{'i.)^f(i) + xx.Lx, + Xy.Ly^ + e , (16) 

with respect to the unknowns and xy.. Observe 

that ( fTSI ) has at least one solution due to ( fTSl ). 
From ( fTSl l and ( fT6] l, we deduce that 

= - Xf(^^)Lf(i) + {Xy^ - Xy^ + z)Ly^ . 

This equality implies that = (otherwise, by 

Corollary 13.91 the sum in the right-hand side will have 
nonzero weight). Hence, Ri is able to determine as 
claimed. 



We now describe a syndrome decoding algorithm for linear 
error-correcting index codes. From ([14], we have 

Vt - xx^Lx, -e,e span({i^(j)} U {L^}jey,) . 

Let "ifi — span({Ly(i)} U {Lj}j^y.), and let if*-*-' be a parity 
check matrix of "^i. We obtain that 

H^^)ef ^H^'\y, -xxM^ ■ (17) 
Let /3j be a column vector defined by 

f3^ = H^'\y^-xx,Lxj'^ . (18) 

Observe that each Ri is capable of determining /3j. Then we 
can re-write (fTT^ as 

This leads us to the formulation of the following decoding 
procedure for Ri. 



• Input: j/j, xxi, L. 

m Step 1: Compute the syndrome 

l3^=H^'\y, -xx,Lx,Y . 

« Step 2: Find the lowest Hamming weight solution e of 
the system 

ffWe'^ = /3, . (19) 

• Step 3: Given that xxt — xx^, solve the system for 

y,=xL + e. (20) 

• Output: Xf(^iy 



Fig. 2: Syndrome decoding procedure. 

Remark 7.2: Gaussian elimination can be used to solve ( |20l ) 
for Xf(iy However, since L also corresponds to an H-IC, there 
is more efficient way to do so. From Lemma |331 there exists 
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a vector Vi <l satisfying Vi + ej(j-) G colspan(L). Hence 
Vi + e.f{i) = uL^ for some u G F^. Therefore 

= xLnF — xvj 

With the knowledge of L and Xx^, Ri can determine u and 
xvf. Therefore, it can also determine Xj^^iy Note that (|20] i 
may have more than one solution x with i;;^'; ^ x^i- However, 
as shown in the next theorem, if at most S errors occur in y,, 
then it always holds that = 

Theorem 7.3: Let = xL + be the vector received 
by Ri, and let wt(ei) ^ 6. Assume that the procedure in 
Figure |2] is appUed to {y^,xxi, L). Then, its output satisfies 

%W = ^fi^)■ 

Proof: By Lemma ItTI it is sufficient to prove that e S 
Li{ei). Indeed, since 

we have 

Hence, e — e ^^i, and therefore, 

e - = + ^ ZjLj, (21) 

jeyi 

for some G and Zj e F^, j E 

Since is a solution of (fT9] l. and wt(ei) < 5, we deduce 
that wt(e) < 6 as well. Hence, 

wt Zf(^i)Lf(^i-) + ^ ZjLj = wt (e - Cj) < 26 . 
\ ^ey. J 

Therefore, by Corollary 13.9] — 0. Hence, e e £^(6^), as 
desired, and therefore a;j(i) = a;^(i). ■ 

Remark 7.4: We anticipate Step 2 in Figure |2] to be com- 
putationally hard. Indeed, the problem of finding e over F2 of 
the lowest weight satisfying 

ijWe^ = /3, , (22) 

for a given binary vector /3, is at least as hard as a decision 
problem COSET WEIGHTS that was shown in |l20l to be NP- 
complete. 

VIII. Static Codes and Related Problems 

A. Static Error-Correcting Index Codes 

In the previous sections we focused on linear 5-error- 
correcting index codes for a particular instance of the ICSI 
problem. When some of the parameters rn, n, X, and / are 
variable or not known, it is very likely that an error-correcting 
index code for the instance with particular values of these 
parameters can not be used for the instances with different 
values of some of these parameters. Therefore, it is interesting 
to design an error-correcting index code which will be suitable 
for a family of instances of the ICSI problem. 



Definition 8.1: Let F — {(m, n, X, /)} be a set of instances 
for an ICSI problem. A (5-error-correcting index code over 
is said to be static under the set F if it is a (5-eiTor-coiTecting 
(rn, n, X, f)-lC over F^ for all instances (to, n, X, /) G F. 

Recall that an instance (to, n, X, f) can be described by 
the side information hypergraph H{m,n, X , f). For a set F 
of instances (to, n, X, /), let 

5(r)= U jmm,n,XJ)), (23) 

(rn,n.XJ)er 

where J{%{m^ n, X, /)) is defined as in (O. We also define 

n(F) = max{n : (to, n, X, f) G F}. 

Lemma 8.2: The 71(F) x N matrix L corresponds to a 5- 
error-correcting index code which is static under F if and only 
if 

wt j j > 2<5+l, 

for all K G 5(F) and for all choices of Zi G F*, z G K. 

Proof: The proof follows from Definition 18.11 and 
Lemma 13.81 ■ 

Please notice that when L is used for an instance 
{m,n,X,f) G F with n < n(F), then the last n{T) - n 
rows of L are simply discarded. 

One particular family of interest is F(n,p), the family that 
contains all instances where each receiver owns at least n — p 
messages as its side information. More formally, 

^in,p) ^ {{m,n',X,f) : n'<n 

and Vi G [to], \Xi\>n-p]. 

A (5-error-correcting index code which is static under F(rt, p) 
will provide successful communication between the sender and 
the receivers under the presence of at most 5 errors, despite 
a possible change of the collection of the side information 
sets X, a change of the set of receivers, and a change of the 
demand function, as long as each receiver still possesses at 
least n ~ p messages. 

In the rest of this section, we assume that N >l,n> p>l 
and (5 > 0. 

Definition 8.3: An n x N matrix L is said to satisfy the 
(p, i5)-Property if any nontrivial linear combination of at most 
p rows of L has weight at least 26+1. 

Proposition 8.4: The n x N matrix L corresponds to a 
(5-eiTor-coiTecting linear index code, which is static under 
F(7i, p), if and only if L satisfies the {p, (5)-Property. 

Proof: Let i be an n x matrix that satisfies the (p, 6)- 
Property. We show that this is equivalent to the condition that 
L corresponds to a (5-eiTor-coiTecting linear index code, which 
is static under F(?i, p). By Lemma [8^ it suffices to show that 
Z{T{n,p)) is the collection of all nonempty subsets of [n], 
whose cardinalities are not greater than p. 
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Consider an instance {m,n' , X , f) £ T{n,p). For all i G 
[m], we have \X,\ > n~ p and yi ^ [n']\{f{i) U Xi), and 
thus we deduce that 

\yi\<n' -l-{n- p)<n' ~l-{n' - p)= p- 1. 

Hence by (O, the cardinality of each set in J'{H{m, n' , X, /)) 
is at most 

l + {p~l)^p. 

Therefore, due to (l2Jt . every set in 3(r(n, p)) has at most p 
elements. 

It remains to show that every nonempty subset of [n] whose 
cardinality is at most p belongs to 5(r(n,p)). Consider an 
arbitrary p'-subset K = {ii,i2, ■ ■ ■ ,ip'} of [n], with I < p' < 
p. Consider an instance (m = 1, n, X, /) G T{n, p) with Xi — 
[n]\K and /(I) = ii. Since 

yi = K\{h}, 

we have 

= {n} U 3^1 e J{H{m, n, X, /)) C 0(r(n, p)). 
The proof follows. ■ 

B. Application: Weakly Resilient Functions 

In this section we introduce the notion of weakly resilient 
functions. Hereafter, we restrict the discussion to the binary 
alphabet. 

The concept of binary resilient functions was first intro- 
duced by Chor et. al. in fT\\ and independently by Bennet et. 
al. in 1221 . 

Definition 8.5: A function / : ^ F^' is called t- 

resilient if / satisfies the following property: when t arbitrary 
inputs of / are fixed and the remaining N — t inputs run 
through all the 2^~*-tuples exactly once, the value of / runs 
through every possible output n-tuple an equal number of 
times. Moreover, if / is a linear transformation then it is called 
a linear t-resilient function. We refer to the parameter t as the 
resiliency of /. 

The applications of resilient functions can be found in 
fault-tolerant distributed computing, quantum cryptographic 
key distribution 1,2 11 . privacy amplification [22| and random 
sequence generation for stream ciphers ll23l . Connections 
between linear error-correcting codes and resilient functions 
were established in [21 J. 

Theorem 8.6 ( /|27|/).- Let L be an n x binary matrix. 
Then L is a generator matrix of a linear error-correcting code 
with minimum distance d = t + 1 if and only if f{z) = Lz'^ 
is <-resilient. 

Remark 8. 7: Vectorial boolean functions with certain prop- 
erties are useful for design of stream ciphers. These properties 
include high resiliency and high nonlinearity (see, for instance, 
1231). However, linear resilient functions are still particularly 
interesting, since they can be transformed into highly nonlinear 
resilient functions with the same parameters. This can be 



achieved by a composition of the linear function with a highly 
nonlinear permutation (see ll24l . ll25l for more details). 

Below we introduce a definition of a p-weakly i-resilient 
function, which is a weaker version of a t-resilient function. 

Definition 8.8: A function / : F^ ^ F'2^ is called p- 
weakly t-resilient if / satisfies the property that every set of 
p coordinates in the image of / runs through every possible 
output /9-tuple an equal number of times, when t arbitrary 
inputs of / are fixed and the remaining N — t inputs run 
through all the 2^^*-tuples exactly once. 

Remark 8.9: A p-weakly t-resilient function / : F^ — > F2 
can be viewed as a collection of different t-resilient 
functions F^ — s- Fj, each such function is obtained by taking 
some p coordinates in the image of /. Similarly to |,21J. 
consider a scenario, in which two parties are sharing a secret 
key, which consists of N randomly selected bits. Suppose that 
at some moment t out of the N bits of the key are leaked to 
an adversary. By applying a t-resilient function to the current 
A^-bit key, two parties are able to obtain a completely new and 
secret key of n bits, without requiring any communication or 
randomness generation. However, if the parties use various 
parts of the key for various purposes, they may only require 
one of the p-bit secret keys (instead of the larger n-bit key). 
In that case a p-weakly t-resilient function can be used. By 
applying a p-weakly t-resilient function to the current A^- 
bit key, the parties obtain a set of (p different p-bit keys, 
each key is new and secret (however these keys might not be 
independent of each other). 

Theorem 8.10: Let L be an n x binary matrix. Then L 
satisfies the (p, (5)-Property if and only if the function / : 
F^ F2 defined by f{z) = Lz'^ is p-weakly 2(5-resilient. 
Proof: 

1) Suppose that L satisfies the (p, (5) -Property. Take any 
p-subset K C [n] . By Definition 18.31 the p x A^ sub- 
matrix Lk of i is a generating matrix of the error- 
correcting code with the minimum distance ^ 2(5+1. 
By Theorem [121 the function : F^ ^ F^ defined by 
fxi^) — Lkz^ is 2(5-resilient. Since K is an arbitrary 
p-subset of [n], the function / is p-weakly 2(5-resilient. 

2) Conversely, assume that the function / is p-weakly 26- 
resilient. Take any subset K C [n], \K\ = p. Then the 
function f k ■ ^2 K defined by f^iz) = Lkz'^ 
is 2(5-resilient. Therefore, by Theorem 18. 6J Lk is a 
generating matrix of a linear code with minimum distance 
2(5+1. Since K is an arbitrary p-subset of [n], by 
Proposition 18.41 1/ satisfies the (p, (5)-Property. 



C. Bounds and Constructions 

In this section we study the problem of constructing a matrix 
L satisfying the (p, 5)-Property. Such L with the minimal 
possible number of columns is called optimal. First, observe 
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that from Proposition 18.41 we have 

i=l ^ ^ 

is the set of all nonempty subsets of [n] of cardinality at most 
p. Next, consider an instance {m* ,n, X* , f*) satisfying 

=a(r(n,p)) , (24) 

where H* — H{m* ,n, X* , f*) is the side information hy- 
pergraph corresponding to that instance. Such an instance 
can be constructed as follows. For each subset K = 
{ii,i2, ■ ■ ■ ,*p'} ^ [n] (1 < p' < p), we introduce a receiver 
which requests the message Xi^, and has a set {xj : j £ 
[n]\i<r } as its side information. It is straightforward to verify 
that indeed we obtain an instance {rn* ,n, X* , f*) satisfying 
(|24] |. The problem of designing an optimal matrix L satisfying 
the {p, 5)-Property then becomes equivalent to the problem of 
finding an optimal {S,'H*)-ECIC. Thus, J\fy['H*,5] is equal to 
the number of columns in an optimal matrix which satisfies 
the (p, (5) -Property. 

The corresponding a-bound and K-bound for Afq[H*,6] can 
be stated as follows. 

Theorem 8.11: Let p* be the smallest number such that a 
linear [n, n — p*, > p + l]g code exists. Then we have 

Nq[p,25 + l\ <Nq[n\5] < Ng[p*,26 + l] . 

Proof: The first inequality follows from the a-bound and 
from the fact that a{'H*) ~ p, which is due to ( |24] |. 

For the second inequality, it suffices to show that Kq{T-L*) — 
p*. By Corollary 13.101 an n x matrix L corresponds to an 
TL*-IC if and only if {Li : i G A'} is linearly independent for 
every K e J{'H*). Since J{'H*) is the set of all nonempty 
subsets of cardinality at most p, this is equivalent to saying 
that every set of at most p rows of L is linearly independent. 
This condition is equivalent to the condition that is a parity 
check matrix of a linear code with the minimum distance at 
least p+1 (TF, Chapter 1]. Therefore, a linear 'H*-IC of length 
N exists if and only if an [n, n—N, ^ /0+l]g linear code exists. 
Since p* is the smallest number such that an [n,n — p*,> 
p + l]q code exists, we conclude that Kg{T-L*) ^ p* ■ ■ 

Corollary 8.12: The length of an optimal (5-error-correcting 
linear index code over which is static under r(n, p) satisfies 

J^qis^n*] > P* + 25, 

where p* is the smallest number such that an [n, n — p*, > 
p + l\q code exists. 

Proof: This is a straightforward corollary of Theorem 15. II 
(the Singleton bound) and Theorem 18.1 II ■ 

Corollary 8.13: For q > max{n— 1, p + 2S — 1}, the length 
of an optimal (5-error-correcting linear index code over Fg 
which is static under r(n, p) is p + 25. 

Proof: For q > n — 1 there exists an [n,n — p* , p + 
l]q linear code with p* = p (for example, one can take an 
extended RS code ||26l Chapter 11]). Due to Singleton bound, 
we conclude that p* — pis the smallest value such that [n, n — 



p*, p+ l]q linear code exists. Following the lines of the proof 
of Theorem 18. 11 1 there exists a (5-error-correcting index code 
of length Nq[p, 26 + 1], which is static under T{n, p). As q > 
p + 2(5 — 1, we have 

Nq[p,2S+l]= P + 2S 

(for example, by taking an extended RS code). Due to Corol- 
lary 18.121 this static error-correcting index code is optimal. 

■ 

Remark 8.14: We observe from the proof of Theorem 18.111 
that the problem of constructing an optimal linear (non-error- 
correcting) index code, which is static under r(n,p), is, in 
fact, equivalent to the problem of constructing a parity check 
matrix of a classical linear error-correcting code. 

Example 8.15: Let n — 20, p = 10, (5 = 1 and q = 2. 
From |T8l, the smallest possible dimension of a binary linear 
code of length 20 and minimum distance 11 is 3. We obtain 
that p* = 17. We also have iV2[17,3] = 22. Theorem [STT] 
implies the existence of a one-error-correcting binary index 
code of length 22 which can be used for any instance of IC 
problem, in which each receiver owns at least 10 out of (at 
most) 20 messages, as side information. It also implies that 
the length of any such static error-correcting index code is at 
least A^2[10,3] = 14. CoroUarv 18. 121 provides a better lower 
bound on the minimum length, which is 17 + 2 = 19. 

Example 8.16: Below we show that with the same number 
of inputs N and outputs n, a weakly resilient function may 
have strictly higher resiliency t. From Example 18.151 there 
exists a linear vectorial Boolean function / : (F2)^^ (F2)^° 
which is 10-weakly 2-resilient. According to [18], an optimal 
linear [22, 20]2 code has minimum distance d — 2. Hence, due 
to Theorem 18.61 the resiliency of any linear vectorial Boolean 
function g : (F2)^^ (F2)^° cannot exceed one. 

The problem of constructing an 7i x matrix L which 
satisfies the (p, ^) -Property is a natural generalization of the 
problem of constructing the parity check matrix H of a linear 
[n,k,d > p + l]q code. Indeed, H is a parity check matrix 
of an [n,k,d > p + 1]^ code if and only if every set of 
p columns of H is linearly independent. Equivalently, any 
nontrivial linear combination of at most p columns of H has 
weight at least one. For comparison, L satisfies the (p, 6)- 
Property if and only if any nontrivial linear combination of at 
most p columns of L'^ has weight at least 2(5 + 1. 

Some classical methods for deriving bounds on the param- 
eters of error-correcting codes can be generalized to the case 
of linear static error-correcting index codes. Below we present 
a Gilbert- Varshamov-like bound. 

Theorem 8.17: Let Vq{N,26) denotes the volume of q-ary 
sphere of radius 26 in F^ given by (|9]l. If 
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then there exists an n x iV matrix L which satisfies the (p, 5)- 
Property. 

Proof: We build up the set TZ of rows of L one by one. 
The first row can be any vector in of weight at least 2(5 + 1. 
Now suppose we have chosen r rows so that no nontrivial 
linear combination of at most p among these r rows have 
weight less than 2(5+1. There are at most 

T/,(iV,2(5) ^f':)((Z-ir 



vectors which are at distance less than 2(5 + 1 from any linear 
combination of at most p — 1 among r chosen rows (this 
includes vectors at distance less than 2(5 + 1 from 0). If this 
quantity is smaller than 17^, then we can add another row to 
the set TZ so that no nontrivial linear combination of at most 
p rows in TZ has weight less than 25 + 1. The claim follows 
if we replace r by n — 1. ■ 



Remark 8.18: If we apply Theorem 16.11 to the instance 
(m*, n, X* , /*) defined in the beginning of this section, then 
we obtain a bound, which is somewhat weaker then its 
counterpart in Theorem 18.171 namely the n x N matrix L 
as above exists if 



E 

1=1 



IX. Conclusions 

In this work, we generalize Index Coding with Side Infor- 
mation problem towards a setup with errors. Under this setup, 
each receiver should be able to recover its desired message 
even if a certain amount of errors happen in the transmitted 
data. This is the first work that considers such a problem. 

A number of bounds on the length of an optimal error- 
correcting index code are constructed. As it is shown in 
Example 14.81 a separation of error-correcting code and index 
code sometimes leads to a non-optimal scheme. This raises a 
question of designing coding schemes in which the two layers 
are treated as a whole. Therefore, the question of constructing 
error-correcting index codes with good parameters is still open. 

A general decoding procedure for linear error-correcting 
index codes is discussed. The difference between decoding 
of a classical error-correcting code and decoding of an error- 
correcting index code is that in the latter case, each receiver 
does not require a complete knowledge of the error vector 
This difference may help to ease the decoding process. Finding 
an efficient decoding method for error-correcting index codes 
(together with their corresponding constructions) is also still 
an open problem. 

The notion of error-correcting index code is further gener- 
alized to static index code. The latter is designed to serve a 
family of instances of error-correcting index coding problem. 
The problem of designing an optimal static ECIC is studied, 
and several bounds on the length of such codes are presented. 
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Appendix 

Lemma A.l: If Qu is symmetric, then the generalized in- 
dependence number of Ti, is the independence number of Q-h . 

Proof: It suffices to show that if Q-u is symmetric, then 
the set of generalized independent sets of % and the set of 
independent sets of Qu coincide. 

Let be a generalized independent set in H. If | = 1, 
then obviously H is an independent set in Q-^. Assume that 
\H\ ^ 2. For any pair of vertices i,j e H, the set 
belongs to J{T-L). By definition of J{T-L), either there is no 
edge from i to j, or there is no edge from j to i, in Q-u- 
Since Qu is symmetric, there are no edges between i and j, 
in neither directions. Therefore, H is an independent set in 

Conversely, let H be an independent set in Q-^. For each 
i £ H, since there are no edges from i to all other vertices in 
H, we deduce that H\{{} C y^. Due to ([3]), every subset of H 
which contains i belongs to J{T-L). This holds for an arbitrary 
i E H. Therefore, every nonempty subset of H belong to 
J{T-C)- We obtain that iJ is a generaUzed independent set of 

n. ■ 
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